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In this letter, we propose a simple rule that generates scale-free networks with very large cluster- 
ing coefBcient and very small average distance. These networks are called Random Apollonian 
Networks(RANs) as they can be considered as a variation of Apollonian networks. We obtain the 
analytic result of power-law exponent 7 = 3 and clustering coefBcient C — ^ — 361n| ~ 0.74, 
which agree very well with the simulation results. We prove that the increasing tendency of average 
distance of RAN is a little slower than the logarithm of the number of nodes in RAN. Since many 
real- life networks are both scale- free and small- world, RANs may perform well in mimicking the 
reality. The epidemic spreading process is also studied, we find that the diseases spread slower 
in RANs than BA networks in the early stage of SI process, indicating that the large clustering 
coefficient may slower the spreading velocity especially in the outbreaks. 

PACS numbers; 89.75.Hc,64.60.Ak,84.35.-l-i,05.40.-a,05.50-l-q,87.18.Sn 
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Recently, empirical studies indicate that the networks 
in various fields have some common characteristics, which 
inspires scientists to construct a general modelQ. One 
of the most well-known models is Watts and Strogatz's 
small- world network (WS network), which can be con- 
structed by starting with a regular network and randomly 
moving one endpoint of each edge with probability p |2| . 
Another significant one is Barabasi and Albert's scale- 
free network model (B A network) . The BA model sug- 
gests that two main ingredients of self-organization of a 




FIG. 1: The sketch maps for the network growing process. 
The four figures show a possible growing process for RAN at 
t = 1(a), t = 2(b), t = 3(c) and t = 4(d). At time step 1, the 
4th node is added to the network and linked to node 1, 2 and 

3. Then, at time step 2, the triangle A134 is selected, the 5th 
node is added inside this triangle and linked to node 1, 3 and 

4. After that, the triangles A234 and A124 are selected and 
node 6 and 7 are added inside them, respectively at time step 
3 and 4. Keep on the similar iterations, one can get RANs of 
any orders as he like. 



network in a scale-free structure are growth and prefer- 
ential attachment. 

A few authors have demonstrated the use of pure 
mathematical objects and methods to construct scale- 
free networks. One interesting instance is the so-called 
integer networks 5], of which the nodes represent inte- 
gers. Another related work is owed to Dorogovtsev and 
Mendes et al, in which the deterministic networks, named 
pseudofractals, are obtained by attachment aiming at 
edges 4] . Here, we focus on the so-called Apollonian 
Networks (ANs) introduced by Andrade et al6j. The 
networks can be produced as follow: start with a tri- 
angle and then at each generation, inside each triangle, 
a node is added and linked to the three vertices. Dove 
et. al. have studied the properties of ANs detailedlyQ, 
and shown the degree distribution p{k) cx k~'^, aver- 
age length I oc (InNY , where 7 = 1 -|- j2| ~ 2.585, 
(i K, 0.75 and N is the orderQ- In this letter, we pro- 
pose a simple rule that generates scale- free networks with 
very large clustering coefficient and very small average 
distance. These networks are called Random Apollo- 
nian Networks (RANs), since they can be considered as 
a variation of ANs. 

RAN starts with a triangle containing three nodes 
marked as 1, 2 and 3. Then, at each time step, a trian- 
gle is randomly selected, and a new node is added inside 
it and linked to its three vertices. The sketch maps for 
the network growing process are shown in figure 1 . Note 
that, after a new node is added, the number of trian- 
gles increases by 2. Therefore, we can immediately get 
that when the networks are of order N , the number of 
triangles are: 



iVA = 2{N -2,) + l = 2N 



(1) 



Let N'}^ denote the number of triangles containing the 

«th node, the probability that a newly added node will 
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FIG. 2: Degree distribution of RANs, with = 640000(black 
squares), N = 320000(red circles) and iV = 160000(blue 
triangles), where P{k) denotes the number of nodes of de- 
gree k. The power-law exponents are 7540000 ~ 2.94 ± 0.04, 
7320000 = 2.92 ± 0.05 and 7160000 ~ 2.92 ± 0.06, respectively. 
The average of them are 2.93. The inset shows Degree dis- 
tribution with N = 80000 (black squares), N = 40000 (red 
circles) and N — 20000(blue triangles). The exponents are 
7640000 = 2.91 ± 0.07, 7320000 = 2.90 ± 0.07 and 7160000 = 
2.90 ± 0.09. The mean value is 2.90. The two dash lines have 
slope -3.0 for comparison. 




FIG. 3: The clustering coefficient of RAN(black squares) 
and BA networks(red circles). In this figure, one can find 
that the clustering coefficient of RAN is almost a constant a 
little smaller than 0.77, which accords with the analytic re- 
sult(dash line) very well. The inset shows that the clustering 
coefficient of BA networks, which decreases with the increas- 
ing of network order quickly. 



the node 1, 2 and 3, N}^ is equal to the degree of the 
ith node: = ki. Therefore, we can write down a rate 
equation for the degree distribution. Let n{N, k) be the 
number of nodes of degree k when N nodes are present, 
we have: 



n{N+l,k+l) 



n{N,k) — +n{N,k+l){V 



k + 1 



) (2) 



When N is sufficient large, n{N, k) can be approximated 
as Np{k), where p{k) is the probability density function 
for the degree distribution. In terms of p{k), the above 



equation can be rewritten as: 

(iV+l),(^+l) . ^+Npik+1)-^^' + + ^) 



N, 



A 



(3) 

Using Equ.(l) and the expression p{k + 1) — p{k) — 
we can get the continuous form of Equ.(3): 



dp 37V-5 ^ 



(4) 



This lead to p{k) cx k'f with 7 = (3iV -5)/N ^ 3 for 
large N. Figure 2 shows the simulation results, which 
agree with the analytic one very well . 

In succession, let us calculate the clustering coefficient 
of RANs. For an arbitrary node x, the clustering coeffi- 
cient C{x) is: 



2E{x) 



k{x){k{x) 



1 



(5) 



where E{x) is the number of edges among node x's 
neighbor-set A{x), and k{x) — \A{x)\ is the degree of 
node X. The clustering coefficient C of the whole net- 
work is defined as the average of C{x) over all nodes. At 
the very time when node x is added to the network, it is 
of degree 3 and E{x) = 3. After, if the degree of node x 
increases by one(i.e. a new node is added to be a neigh- 
bor of x), then E{x) will increases by two since the newly 
added node will link to two of the neighbors of node x. 
Therefore, we can write down the expression of E[x) in 
terms of fc(x): E{x) = 3 + 2(fc(x)-3) = 2fc(x)-3, which 
leads to: 



C{x) = 

Consequently, we have: 



2(2fc(a;) -3) 
k{x)(k{x) - 1) 



C 



N ^ 



ki[ki 1) 



2 ^ 3 

-Yi--- 

i—1 



(6) 



(7) 



where ki denotes the degree of the iih node. Rewrite 



'l2iLi fi^i) i'^ continuous form, we have 



C = 6 



dk 



(8) 



k Jk„^^^ fc - 1 

denote the minimal and maximal 
degree in RAN, respectively. Note that p{k) = ak~'* 
with 7 = 3 and a a constant, one have: 



C = 6a 



k-'^dk - 2a 



1 



fc3(fc - 1) 



dk (9) 



Using the normalization equation '""^ p{k)dk ~ 1 and 
the approximate condition that k^ax 3* kmin = 3, we 



have C = f - 36ln| « 0.74. 



Figure 3 shows the simulation results about the clus- 
tering coefficient of RAN, which agree very well with 
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FIG. 4: The dependence between the average distance L and 
the order A'^ of RANs. The inset exhibits the curve where 
L is considered as a function of InA^, which is well fitted by 
a straight line. The curve is above the fitting line when A'^ 
is small(1000 < N < 3000) and under the fitting line when 
A'^ is large(A'' > 4000), which indicates that the increasing 
tendency of L can be approximated as InA'' and in fact a little 
slower than InN. All the data are obtained by 10 independent 
simulations. 



the analytic one. It is remarkable that, the clustering 
coefficient of BA networks is very small and decreases 
with the increasing of network order, following approx- 
imately C ^ In^iV/A^Q. Since the data-flow patterns 
show a large amount of clustering in interconnection net- 
works, the RAN may perform better than BA networks. 
In addition, the demonstration exhibits that most real- 
life networks have large clustering coefficient no matter 
how many nodes they have, which agrees with the case 
of RAN but conflicts with the case of BA networks, thus 
RAN may be more appropriate to mimic the reality. 

At last, let's discuss the average distance of RANs. 
Marked each node according to the time when it is added 
to the network(see figure 1), then we have the following 
Lemma: for any two nodes i and j, each shortest path 
from i to j does not pass through any nodes k satisfying 
that k > max{i,j}. The proof is a routine exercise thus 
omitted here. 

Using symbol d{i,j) to represent the distance between 
i and j, the average distance of RAN with order TV, de- 
noted by L{N), is defined as: L{N) = -^^^^7 where 

the total distance is: a{N) = J2i<i<j<N ^i^^j)- 

According to the lemma, newly adHed node will not 
affect the distance between old nodes. Hence we have: 



N 



a{N + 1) = a{N) +Y,d{i,N + 1) 



(10) 



Assume that the node 4- 1 is added into the triangle 
Ayi2/22/3, then the Equ.(lO) can be rewritten as: 



N 



a{N + l)^aiN)+J2(.Dii,y) + l) 



(11) 



where D(i, y) = min{d{i, yi), d{i, ^2), d{i, ^3)}. Constrict 
^2/12/22/3 continuously into a single node y, then we have 



Dii, v) = d{i, y). Since d{yi,y) = d{y2,y) = d{y3, y) = 0, 
the Equ.(ll) can be rewritten as: 



cr{N + 1) = a{N) + N + v) 



(12) 



where F = {1, 2, • ■ • , N} — {2/1,1/2, 2/3} is a node set with 
cardinality — 3. The sum X^ier '^(*' 2^) '-^'^ consid- 
ered as the total distance from one node y to all the other 
nodes in RAN with order N — 2. In a rough version, the 
sum X^ier '^(*' v) approximated in terms of L{N — 2): 



^d(z,y)«(7V-3)L(iV-2) 



(13) 



Note that, the average distance L{N) increases 
monotonously with A^, it is clear that: 



{N -3)L{N -2) = 



2(t{N 



_2) ^ 2a{N) 



N 



(14) 



Combining Equs.(12), (13) and (14), one can obtain the 
inequality: 



(7{N + 1) < a{N)+N + 



2a{N) 
N 



(15) 



Consider (15) as an equation, then the increasing ten- 
dency of (j{N) is determined by the equation: 



dN 

This equation leads to 



daiN) _^ , 2aiN) 



N 



a{N) = N^lnN + H 



(16) 



(17) 



where ff is a constant. As a-{N) ~ N'^L{N), we have 
L{N) ~ InN. Which should be pay attention to, since 
(15) is an inequality indeed, the precise increasing ten- 
dency of L may be a little slower than InA^. 

As we mentioned above, close to many real-life net- 
works, random Apollonian networks are both scale-free 
and small-world. Therefore, it is worthwhile to investi- 
gate the processes taking place upon RAN and directly 
compare these results with just small-world and just 
scale-free networks. One widely studied process is the 
epidemic spreading process 0; and for the sake of pro- 
tecting networks and finding optimal strategies for the 
deployment of immunization resources, it is of practi- 
cal importance to study the dynamical evolution of the 
outbreaks [Hill. Numerical simulations aiming at BA, 
and RAN are shown in figure 5 with the pro- 
cess and the conception of density and velocity all the 
same as previous studies mill. The result that diseases 
spread more quickly in RAN and BA networks than in 
NW networks is easy to be understood as the well-known 
conclusion: boarder degree distribution will speed up the 
epidemic spreading process [Toj. 

Why the diseases spread more quickly in BA networks 
than RAN is a very interesting question. We argue 
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t {time steps) i (time step) 



FIG. 5: The left and right plots show average density of 
infected individuals and spreading velocity versus time with 
N = 10000 and average degree (fc) = 6 fixed. The black, red 
and blue curves correspond to the case of BA, RAN and NW 
networks respectively. The NW networks are of z = 1 and 
= 4x 10~^, thus {k) ~ 2z+(j>N = 6^^{see also the accurate 
definitions of z and cf> in reference [ij])- The spreading rate 
is A = 0.01. All the data are averaged over 10^ independent 
runs. The spreading velocity reaches a peak quickly. Before 
the peak-time, the spreading velocities of the three kinds of 
networks satisfy the inequality iif^^ > t'i^/^ > t'i^ ■ 

that the larger clustering coefficient may slow down the 
spreading process especially in the outbreaks. Remove an 
arbitrary edge e{x,y) from quondam network, then, the 
distance between x and y will increase as d!{x,y) > l(if 
the removal of e makes x and y disconnected, then we set 
d'{x, y) — N). The quantity d'{x, y) can be considered as 
edge e's score s(e) = d'{x, y) > 2, denoting the number of 
edges the diseases must pass through from a; to y or form 
y io x\i they do not pass across e. If s(e) is small, then e 
only plays a local role in the spreading process, else when 
s(e) is large, e is of global importance. For each edge e, if 
it does some contribution to clustering coefhcient, it must 
be contained in at least one triangle and s(e) — 2. There- 
fore, networks of larger clustering coefficient have more 



local edges. RANs and BA networks are two extreme 
cases of scale-free networks. In RANs, all the edges are 
of score 2; while in BA networks, almost all the edges are 
of score larger than 2 because the clustering coefficient of 
BA networks will decay to zero quickly as N increases. 
Consequently, diseases spread more quickly in BA net- 
works than RAN. The above explanation is qualitative 
and rough, to study the process upon networks with tun- 
able clustering coefficient^^ may be useful, which will 
be the future work. 

In respect that the RANs are of very large clustering 
coefficient and very small average distance, they are not 
only scale-free, but also small-world. Since many real-life 
networks are both scale-free and small-world, RAN may 
perform better in mimicking reality than WS and BA 
networks. The epidemic spreading process is also stud- 
ied, in which the diseases spread slower in RAN than 
BA networks during the outbreaks, indicating that the 
large clustering coefficient may slower the spreading ve- 
locity especially in the outbreaks. This numerical study 
suggests that the clustering structure may affect the dy- 
namical behavior upon networks much. Further more, 
many real-life networks are planar networks by reason 
of technical or natural requirements, such as layout of 
printed circuits, river networks upon the earth's surface, 
vas networks clinging to cutis, and so forth. Since the 
number of edges in RAN is equal to 3iV — 6, RAN are 
maximal planar networks,!^, which are possibly of par- 
ticular practicability for layout of printed circuits and so 
on. 
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